Bridging Transient and Steady-State Performance in Voltage Control: A Reinforcement Learning Approach With Safe Gradient Flow

نویسندگان

چکیده

Deep reinforcement learning approaches are becoming appealing for the design of nonlinear controllers voltage control problems, but lack stability guarantees hinders their deployment in real-world scenarios. This paper constructs a decentralized RL-based controller inverter-based real-time distribution systems. It features two components: transient policy and steady-state performance optimizer. The is parameterized as neural network, optimizer represents gradient long-term operating cost function. parts synthesized through safe flow framework, which prevents violation reactive power capacity constraints. We prove that if output bounded monotonically decreasing with respect to its input, then closed-loop system asymptotically stable converges optimal solution. demonstrate effectiveness our method by conducting experiments IEEE 13-bus 123-bus test feeders.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Constrained adaptive control with transient and steady-state performance guarantees

Over the last decades research has been performed in order to improve the transient behavior of adaptive systems. To that end, this paper develops a new adaptive control architecture for uncertain dynamical systems to achieve guaranteed transient performance in the presence of state constraints. For this purpose, we extended a recently developed command governor method. Specifically, the comman...

متن کامل

A robust adaptive control architecture with L∞ transient and steady-state performance guarantees

In this paper, a new adaptive control architecture for nonlinear uncertain dynamical systems is developed to address the problem of high-gain adaptive control. Specifically, the proposed framework involves a new and novel controller architecture involving a modification term in the update law that minimizes an error criterion involving the distance between the weighted regressor vector and the ...

متن کامل

Analyzing Steady State and Transient State Performance of Transmission Control Protocol in the Internet

The Internet uses a window-based congestion control mechanism in TCP (Transmission Control Protocol). In the literature, there have been a great number of analytical studies on TCP. Most of those studies have focused on the statistical behavior of TCP by assuming a constant packet loss probability in the network. However, the packet loss probability, in reality, changes according to packet tran...

متن کامل

Safe Exploration of State and Action Spaces in Reinforcement Learning

In this paper, we consider the important problem of safe exploration in reinforcement learning. While reinforcement learning is well-suited to domains with complex transition dynamics and high-dimensional state-action spaces, an additional challenge is posed by the need for safe and efficient exploration. Traditional exploration techniques are not particularly useful for solving dangerous tasks...

متن کامل

Safe State Abstraction and Discounting in Hierarchical Reinforcement Learning

The great benefit in state abstraction for hierarchical reinforcement learning (HRL) is the potential improvement in computational complexity with significant compaction of the value function. Safe state aggregation of reusable sub-task states is not possible in general for a decomposed MDP using one decomposed discounted cumulative reward function. This severely limits the effectiveness of HRL...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Control Systems Letters

سال: 2023

ISSN: ['2475-1456']

DOI: https://doi.org/10.1109/lcsys.2023.3289435